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1 Introduction 

The determination of capacities of quantum channels in various settings has been a field of intense work 
over the last decade. In contrast to classical information theory, to any quantum channel we can associate 
in a natural way different notions of capacity depending on what is to be transmitted over the channel 
and which figure of merit is chosen as the criterion for the success of the particular quantum commu- 
nication task. For example we may try to determine the maximum number of classical messages that 
can be reliably distinguished at the output of the channel leading to the notion of classical capacity of 
a quantum channel. We might as well wish to establish secure classical communication over a quantum 
channel, giving rise to the definition of a channel's private capacity. 

On the other hand, in the realm of quantum communication, one may ask e.g. the question what the 
maximal amount of entanglement is that we can generate or transmit over a given quantum channel, 
leading to the notions of entanglement-generating and entanglement transmission capacities. Other ex- 
amples of quantum capacities are the subspace transmission and average subspace transmission capacities. 
Such quantum communication tasks are needed, for example, to support computation in quantum circuits 
or to provide the best possible supply of pure entanglement in a noisy environment. Fortunately, these 
genuinely quantum mechanical capacities are shown to be equal for perfectly known single user channels 

First results indicating that coherent information was to play a role in the determination of the quantum 
capacity of memoryless channels were established by Schumacher and Nielsen ^26j and, independently, by 
Lloyd |23| who was the first to conjecture that indeed the regularized coherent information would give the 
correct formula for the quantum capacity and gave strong heuristic evidence to his claim. In 1998 Barnum, 
Knill, and Nielsen and Barnum, Nielsen, and Schumacher [Ij gave the first upper bound on the capacity of 
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a memoryless channel in terms of the regularized coherent information. Later on, Shor and Devetak 
[To] offered two independent approaches to the achievability part of the coding theorem. Despite the fact 
that the regularized coherent information was identified as the capacity of memoryless quantum channels 
many other approaches to the coding theorem have been offered subsequently, for example Devetak and 
Winter [TT] and Hayden, Shor, and Winter [T3|. Of particular interest for our paper are the developments 
by Klesse [30] and Hayden, Horodecki, Winter, and Yard [13] based on the decoupling idea which can 
be traced back to Schumacher and Westmoreland [28]. In fact, the main purpose of our work is to show 
that the decoupling idea can be utilized to prove the existence of reliable universal quantum codes for 
entanglement transmission and generation. 

On the other hand, the classical capacity of memoryless quantum channels has been determined in the 
pioneering work by Holevo [T3] and Schumacher and Westmoreland [17] . Their results have been substan- 
tially sharpened by Winter [31] and Ogawa and Nagaoka [25] who gave independent proofs of the strong 
converse to the coding theorem. 

However, most of the work done so far on quantum channel capacities relies on the assumption that the 
channel is perfectly known to the sender and receiver. Such a requirement is hardly fulfilled in many 
situations. In this paper we consider compound quantum channels which are among the simplest non- 
trivial models with channel uncertainty. A rough description of this communication scenario is that the 
sender and receiver do not know the memoryless channel they have to use. The prior knowledge they have 
access to is merely that the actual channel belongs to a set 3 of channels which in turn is known to the 
sender and receiver. It is important to notice that we impose no restrictions on the set 3, i.e. it can be 
finite, countably-infinite or uncountable. Our intention is to identify the best rates of quantum codes for 
entanglement transmission and generation that are reliable for the whole set of channels 3 simultaneously. 
This is, in some sense, a quantum channel counterpart of the universal quantum data compression result 
discovered by Jozsa and the Horodecki family [T8] . 

While the classical capacity of compound quantum channels has been determined only recently in [3], in 
this paper we will focus on entanglement-generating and entanglement transmission capacities of com- 
pound quantum channels. Specifically we will determine both of them and show that they are equal. The 
investigation of their relation to other possible definitions of quantum capacity of compound quantum 
channels in spirit of [1], [21] will be given elsewhere. 

1.1 Related Work 

The capacity of compound channels in the classical setting was determined by Wolfowitz [351 [33] and 
Blackwell, Breiman, and Thomasian [5j . The full coding theorem for transmission of classical information 
via compound quantum channels was proven in ,3j. Subsequently, Hayashi [T^ obtained a closely related 
result with a completely different proof technique based on the Schur-Weyl duality from representation 
theory and the packing lemma from |7]. 

In our previous paper [4j we determined the entanglement transmission capacity of finite quantum com- 
pound channels (i.e. |3| < oo). Moreover, we were able to prove the coding theorem for arbitrary 3 with 
informed decoder. It is important to remark here that we used a different notion of codes in [4J , following 
[20j , which is motivated by the theory of quantum error correction. In the cases of an informed decoder 
and uninformed users this change does not appear to be of importance. In the case of an informed encoder 
it is of crucial importance in the proof of the direct part of the coding result. 

In our former paper, the strategy of proof was as follows. First, we derived a modification of Klesse's one- 
shot coding result [20] that was adapted to arithmetic averages of channels. Application of this theorem 
combined with a discretization technique based on r-nets yielded the coding result for quantum compound 
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channels with informed decoder and arbitrary 3. 

With the help of the channel-estimation technique developed by Datta and Dorlas [9] we were able to show 
that in the case of a finite compound channel it is asymptotically of no relevance if one spends the first 
transmissions for channel estimation, thus turning an uninformed decoder into an informed decoder. 
Since for an informed decoder we had already proven the existence of good codes, we were able to obtain 
the full coding result in the case \3\ < oo. 

Unfortunately, the speed at which one can gain channel knowledge using the channel estimation technique 
we employed is highly dependent on the number of channels. Due to this fact, the combination of channel 
estimation and approximation of general compound channels through finite ones did not seem to work in 
the other two cases. 

In this paper, we use a more direct strategy. First, we derive one-shot coding results for finite compound 
channels with uninformed users and informed encoder. In order to evaluate the dependence of the derived 
bounds on the block length we have to project onto typical subspaces of suitable output states of the indi- 
vidual channels. Therefore, it turns out that we effectively end up in the scenario with informed decoder. 
Now, instead of employing a channel estimation strategy we study the impact of these projections onto 
the typical subspaces on the entanglement fidelity of the entire encoding-decoding procedure. It turns 
out that these projections can simply be removed without decreasing the entanglement fidelity too much 
and we have got a universal (i.e. uninformed) decoder for our coding problem. Then, again, using the 
discretization technique based on r-nets we can convert these results for finite 3 to arbitrary compound 
quantum channels. 

Another difference to our previous paper [4] is that we determine the optimal rates in all the scenarios de- 
scribed above for entanglement generation over compound quantum channels and show that they coincide 
with the entanglement transmission capacities. 

1.2 Outline 

Section [2] contains the fundamental definitions of codes and capacities for entanglement transmission in 
all three different settings. Moreover, the reader can find there the statement of our main result. 
It is followed by a section on one-shot results containing the one-shot result of Klesse [20] , as well as our 
modifications thereof. The modified coding results guarantee the existence of unitary encodings as well 
as recovery operations for finite arithmetic averaged channels in all three different cases and establish a 
relation between the rate of the code and its entanglement fidelity. We also give an estimate relating the 
entanglement fidelity of a coding-decoding procedure to that of a disturbed version, where disturbance 
means that the application of the channel is followed by a projection. 

With these one-shot results at hand, in Section [5] we are able to prove the existence of codes for entan- 
glement transmission of sufficiently high rates and entanglement fidelity asymptotically approaching one 
exponentially fast in the case of finite compound channels. 

Section [5] states the basic properties of finite size nets in the set of quantum channels. They are used to 
approximate general sets of quantum channels and provide the link between finite and general compound 
channels. The construction is such that their size depends polynomially on the approximation parameter. 
We use the coding results for finite compound channels and the properties of finite nets in section [5] to 
derive sharp lower bounds on the entanglement transmission capacity of general compound channels. This 
section also contains variants of the BSST Lemma [2] where BSST stands for Bennett, Shor, Smolin, and 
Thapliyal. The proofs rely heavily on the difference in the polynomial growth of nets versus exponentially 
fast convergence to entanglement fidelity one for the codes in the finite setting. 

The next section [7] contains the converse parts of the coding theorems for general compound channels. 
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Since the converse must hold for arbitrary encoding schemes and since we expHcitly allow the code space 
to be larger than the input space of the channels, we deviate from the usual structure and instead employ 
the converse part for the case of entanglement generation that was developed by Devetak ^2] • We also use 
a recent continuity result due to Leung and Smith [22] that connects the difference in coherent information 
between nearby channels. 

In section [5] we show, once again using the work of Leung and Smith [22 , that the entanglement trans- 
mission capacities of compound quantum channels are continuous with respect to the Hausdorff metric. 
In the final section [5] we apply the results obtained so far to determine the entanglement-generating ca- 
pacities of compound quantum channels. It is not very surprising that it turns out that they coincide with 
their counterparts for entanglement transmission. 

1.3 Notation and Conventions 

All Hilbert spaces are assumed to have finite dimension and are over the field C. 5(7Y) is the set of states, 
i.e. positive semi-definite operators with trace 1 acting on the Hilbert space TL. Pure states are given 
by projections onto one-dimensional subspaces. A vector of unit length spanning such a subspace will 
therefore be referred to as a state vector. To each subspace of 7i we can associate unique projection qjr 
whose range is the subspace T and we write tt^t for the maximally mixed state on J- , i.e. ttjt fj^^^. 
The set of completely positive trace preserving (CPTP) maps between the operator spaces B{H) and B{K,) 
is denoted by C{H,IC). Thus H plays the role of the input Hilbert space to the channel (traditionally 
owned by Alice) and IC is channel's output Hilbert space (usually in Bob's possession). C^{Ti, JC) stands for 
the set of completely positive trace decreasing maps between B{T-L) and B{JC). ii{H) will denote in what 
follows the group of unitary operators acting on H. For a Hilbert space G C TC we will always identify 
ii{G) with a subgroup of il{H) in the canonical way. For any projection q e B{H) we set := Ifi — q. 
Each projection q G B{Ti.) defines a completely positive trace decreasing map Q given by Q{a) := qaq for 
all a S B{H). In a similar fashion any u G ii{H) defines aU G C{H,H) hyU{a) := uau* for a e B{H). 
We use the base two logarithm which is denoted by log. The von Neumann entropy of a state p £ S{H) 
is given by 

S{p) -tr(plogp). 
The coherent information for J\f e C{H, JC) and p e S{TC) is defined by 

h{p,Af) ■■= S{Af{p)) - S{{idn®Af)mm), 

where ip € Ti. ® Ti. \s ai\ arbitrary purification of the state p. Following the usual conventions we let 
Se{p,Af) ■= S{{id-H(^J^){\'4'){i^\)) denote the entropy exchange. A useful equivalent definition of /c(p,A/') 
is given in terms of A/" € C{TL,JC) and the complementary channel M' G C{'H,'He) where He denotes 
the Hilbert space of the environment: Due to Stinespring's dilation theorem A/" can be represented as 
M{p) — tr-H^ (vpv*) for p G S{Ti.) where w : 7i — > /C (X" Ti-e is a linear isometry. The complementary channel 
A/"' G C(H, He) to AA is given by 

Af'ip) -.^trniypv*) (peSiH)). 
The coherent information can then be written as 

Ie{p,U)=Smp))-S{U'{p)). 
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As a measure of closeness between two states p,a £ S{H) we use the fidelity F{p,a) := H-yPv^Hi. The 

fidelity is symmetric in the input and for a pure state p — \(j)){(t)\ we have u) = {(p, a(p). 

A closely related quantity is the entanglement fidelity. For p e S{Ti) and J\f G C^{H, JC) it is given by 

F,{p,Af) ■■= {i^.{idn®Um)mi'), 
with ip € Ti. IS) Ti. being an arbitrary purification of the state p. 

For the approximation of arbitrary compound channels by finite ones we use the diamond norm || • ||o, 
which is given by 

||AA||o:=sup max | |(id„ ® AA)(a)| |i, 

where id^ : S(C") B{C^) is the identity channel, and Af : BCH) — > B{JC) is any linear map, not 
necessarily completely positive. The merits of || • ||^ are due to the following facts (cf. [H]). First, 
||A/'||o = 1 for all Af e C{H,JC). Thus, C{Ti.,JC) C S^, where denotes the unit sphere of the normed 
space {B{B{n),B{IC)), \\ ■ ||o). Moreover, \\J\fi ^MHo = l|M||oll-^2||o for arbitrary linear maps7Vi,AA2 : 

B{n)^B{K:). 

We further use the diamond norm to define the function Do{-,-) on {{3,3') : 3,3' <Z C{H,IC)}, which is 
for 3,3' C C{n,IC) given by 

I^Opi^') := max{sup inf ||7V-7V'||o, sup inf ||7V-7V'||o}. 

For 3 C C{n,JC) let 5 denote the closure of in || • H^. Then defines a metric on {{3,3') : 3,3' C 
C{Ti,JC), 3 = 5, 3' = 3'{ which is basically the Hausdorff distance induced by the diamond norm. 
Obviously, for arbitrary 3, 3' C Cili., K.), D(){3, 3') < e implies that for every A/" G J (A/"' G 3') there exists 
Af' G 3' {Af G 3) such that \\Af ~ Af'\\o < 2e. U3 = 3, 3' = 3' holds we even have ||7V - 7V'||o < e. In 
this way Do gives a measure of distance between two compound channels. 
Finally, for any set 3 C C {Ti , JC) and Z G N wc set 

j& .= : AA G 3}. 



2 Definitions and Main Result 

Let 3 C C{Ti,K,). The memoryless compound channel associated with 3 is given by the family {A/"®' : 
iS(7i®') '5(/C®')};gN^A/-(=3. In the rest of the paper we will simply write 3 for that family. 
Each compound channel can be used in three different scenarios: 

1. the informed decoder 

2. the informed encoder 

3. the case of uninformed users. 

In the following three subsections we will give definitions of codes and capacity for these cases. 
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2.1 The Informed Decoder 

An {I, ki)-code for 3 with informed decoder is a pair {V\ {T^\j- ■ A/" G 3}) where: 

1. : B{J^i) B{H)®^ is a CPTP map for some Hilbert space Ti with ki = dimjT;. 

2. : B(/C)®' ^ S(:F/) is a CPTP map for each Af g3 where the Hilbert space satisfies J^i C JT/. 

In what follows the operations 7?.Jy- are referred to as recovery (or decoding) operations. Since the 
decoder knows which channel is actually used during transmission, they are allowed to depend on 
the channel. 

Note at this point that we deviate from the standard assumption that J^i = T'l- We allow J^i C J^'^ for 

convenience only since it allows more flexibility in code construction. It is readily seen from the definition 
of achievable rates and capacity below that the assumption J^i C cannot lead to a higher capacity of 3 
in any of the three cases that we are dealing with. 

A non-negative number R is called an achievable rate for 3 with informed decoder if there is a sequence 

of (/, A;;)-codes such that 

1. liminfj^oo j logfc; > i?, and 

2. lim;^oo infATes i^e(7r^, , 7^V o TV®' o p') = 1 
holds. 

The capacity Qid{3) of the compound channel 3 with informed decoder is given by 

Qioi^) '■= sup{i? e M+ : R is achievable for 3 with informed decoder}. 

2.2 The Informed Encoder 

An (I, ki)-code for 3 with informed encoder is a pair {{V\f : Af G 3},TZ^) where: 

1. T'l/- : B{Ti) -> B{H)'^^ is a CPTP map for each TV G J for some Hilbert space J"; with ki = dimJT,. 
The maps V]^ are the encoding operations which we allow to depend on TV since the encoder knows 
which channel is in use. 

2. : B(/C)®' ^ B{T[) is a CPTP map where the Hilbert space T[ satisfies Ti C 

A non-negative number R is called an achievable rate for 3 with informed encoder if there is a sequence 

of (/, fc;)-codcs such that 

1. liminf;_>oo jlogfc; > R, and 

2. lim,^^ infATe:, F,{-k^, , V) o A/"®' o pj^) = 1 
holds. 

The capacity Qie{3) of the compound channel 3 with informed encoder is given by 

Qie{3) := sup{ii e M+ : R is achievable for 3 with informed encoder}. 
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2.3 The Case of Uninformed Users 

Codes and capacity for the compound channel 3 with uninformed users are defined in a similar fashion. 
The only change is that we do not allow the encoding operations to depend on Af. I.e. An {I, ki)— code 
for a is a pair (P^7^') of CPTP maps V'' G C(J^i,H®0 where Ti is a Hilbert space with ki = dimTi and 

7^' e c{k:^',j^i) with Ti c t;. 

A non-negative number R is called an achievable rate for 3 if there is a sequence of {I, fc;)-codes such that 

1. liniinf;^oo j logfc; > R, and 

2. lim^^ooinfAAea-Fe(7r^,,7^' oAA«5' oT'O = 1. 

The capacity QCJ) of the compound channel 3 is given by 

Q{3) := sup{i? G M-|_ : R is achievable for 3}. 
A first simple consequence of these definitions is the following relation among the capacities of 3. 

Q{3) <mm{QiD{3),QiE{3)}. 

2.4 Main Result 

With these definitions at our disposal, we are ready now to state the main result of the paper. 

Theorem 1 Let 3 C C{TL,1C) he an arbitrary set of quantum channels where Ti. and K. are finite dimen- 
sional Hilbert spaces. 

1 . Then 

Q{3) = Qid{3) = lim \ max inf I^{p,N®^), 
i^oc I pe5(w®')^e3 

and 

Qie{3) = lim \ inf max /,(p,A/'®'). 

2. Moreover, for the corresponding entanglement- generating capacities E{3), Eid{3), and Eie(3) (de- 
fined in Section\^ we have 

E(3)^Eid{3)=Q{3) 

and 

Eie{3) = Qie{3). 
The rest of the paper contains a step-by-step proof of Theorem [1] 

3 One- Shot Results 

In this section we will establish the basic building blocks for the achievability parts of the coding theorems 
for compound channels with and without channel knowledge. The results are formulated as one-shot 
statements in order to simplify the notation. 
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3.1 One-Shot Coding Result for a Single Channel 

Before we turn our attention to quantum compound channels we will shortly describe a part of recent 
developments in coding theory for single (i.e. perfectly known) channels as given in [5D] and [13] ■ Both 
approaches are based on a decoupling idea which is closely related to approximate error correction. In 
order to state this decoupling lemma we need some notational preparation. 

Let p e Sili.) be given and consider any purification tp g Ha^^Ti-, Ha = Ti., of p. According to Stincspring's 
representation theorem any Af G C^{'H,JC) is given by 

Af{-)^trnAi^H(^PeH-)v*), (1) 

where Tie is a suitable finite-dimensional Hilbert space, Pe is a projection onto a subspace of Tie, and 
V :T-L K. ® He is an isometry. 

Let us define a pure state on Ha (X" /C ^ He by the formula 
We set 

P tl-H„(»He(l'0'>('/''l), Pae — ( (V'' I ) , 

and 

Pa ■■= tr/c®W^(|?/'')('/''l), Pe — trWa(8/c(|l/'')(V''l)- 

The announced decoupling lemma can now be stated as follows. 

Lemma 2 (Cf. |20] . |13p For p e S{H) and M E C^{H,IC) there exists a recovery operation TZ e 
C{JC,H) with 

Fe{p,'R.oJ\f) > W - ||W/0'^e - Wpa ® Pe 1 1 1 : 

where w = tr{M(p)). 

The striking implication of Lemma[2]is that if the so called quantum error \ \p'ae — Pa® p'eWi for p G S{H) 
and Af € C{H, JC) is small then almost perfect error correction is possible via TZ. 

Lemma was Klesse's |20j starting point for his highly interesting proof of the following theorem which is 
a one-shot version of the achievability part of the coding theorem. In the statement of the result we will 
use the following notation. 

i^e,e(p,AA) max Fe{p,noM), 
' nec(K.,n) 

where pe5(Ti) and TV G C^Ti, /C). 

Theorem 3 (Klesse [20j) Let the Hilbert space H be given and consider subspaces £ <Z Q <Z H with 
dimf = k. Then for any N G C^{H, JC) allowing a representation with n Kraus operators we have 



Fe,e{uTT£U*,Af)du > tr{Af{TTg)) - Vk^\\Af{TTg)\\2, 

where ii{G) denotes the group of unitaries acting on Q and du indicates that the integration is with respect 
to the Haar measure oni!i{Q). 
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We will indicate briefly how Klesse [5D] derived the direct part of the coding theorem for memoryless 
quantum channels from Theorem [3l Let us choose for each I £ N subspaces £i C G"^^ C 7i®' with 

To given A/" € C{H, JC) and Trg Klesse constructed a reduced version Ni of A/"®' in such a way that Ni has 
a Kraus representation with ni < 2'('^'=('^C'^)+^) Kraus operators. Let qi g B{IC^'') be the entropy-typical 
projection of the state (A/'(7rg;))®' and set A/'/(-) :— qiNi{-)qi. Then we have the following properties (some 
of which are stated once more for completeness) 

1. ki = 2'(^c(7rgX)-3£)^ 

2. tr(AA/(7r|'))>l-o(/ofl 

3. < 2^{s.(^gM)+'i)^ and 

4- l|A/i'(7r|')||i < 2"'(^('^s)-^) 

An application of Theorem[3]to A/"/ shows heuristically the existence of a unitary u G and a recovery 

operation Ui e C{IC'»\H'^^) with 

F,(u7r£,u^7^^ o AA/) > 1 - o(/°) - 2~i\ 

This in turn can be converted into 

Fe(w^£,u^7^^oAA«5') > l-o(/0), 

which is the achievability of Ic{TTg,M). The passage from ng to arbitrary states p is then accomplished 
via the Bennett, Shor, Smolin, and Thapliyal Lemma from [2] and the rest is by regularization. 

3.2 One-Shot Coding Result for Uninformed Users 

Our goal in this section is to establish a variant of Theorem[3]that works for finite sets of channels. Since the 
entanglement fidelity depends affinely on the channel it is easily seen that for each set 3 = {A/i, . . . jA/jv} 
any good coding scheme with uninformed users is also good for the channel 

1 ^ 

i=l 

and vice versa. Since it is easier to deal with a single channel and we do not loose anything if passing to 
averages we will formulate our next theorem for arithmetic averages of completely positive trace decreasing 
maps instead of the set {A/i, . . . , A/tv}- 

Theorem 4 (One-Shot Result: Uninformed Users and Averaged Channel) Let the Hilbert 
space Ti be given and consider subspaces £ d G d Ti. with dimS — k. For any choice of 

^Here, o(i'') denotes simply a non-specified sequence tending to as Z — > oo, i.e. we (ab)use the Bachmann-Landau little-o 
notation. 
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A/i,...A/7v G C^(7i,/C) each allowing a representation with nj Kraus operators, j = 1,...,N, we 
set 

1 " 



N 



and and for any u G •U(^) 



TV 

/ F,^,{ne,Nu)du > tr{U{TTg))-2y2^k^\\J\fj{TTg)\\2, 
where the integration is with respect to the normalized Haar measure on il{G). 

Remark 5 It is worth noting that the average in this theorem is no more over maximally mixed states 
like in Theorem\^ but rather over encoding operations. 

Proof. The proof is easily reduced to that of the corresponding theorem in our previous paper [4]. Most 
of the details can also be seen in the proof of Theorem [7] in the next subsection. 

□ 

3.3 One-Shot Coding Result for Informed Encoder 

Before stating the main result of this section we recall a useful lemma from [4j which will be needed in 
the proof of Theorem [7l 

Lemma 6 Let L and D be N x N matrices with non-negative entries which satisfy 

Lji < Ljj , Lji < Lii , (2) 



and 

for all j,l e {!,..., TV}. Then 



Dji<Taayi{Djj,Dii} (3) 



N 

j-i=i i=i 



Proof. The proof of this lemma is elementary. The details can be picked up in our previous paper [4]. 

□ 

We will focus now on the scenario where the sender or encoder knows which channel is in use. Consequently, 
the encoding operation can depend on the individual channel. The idea behind the next theorem is that 
we perform an independent, randomized selection of unitary encoders for each channel in the finite set 
3 = {Ml, . . . ,Mn}- This explains why the averaging in (|4]) is with respect to products of Haar measures 
instead of averaging over one single Haar measure as in Theorem |4l 
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Theorem 7 (One-Shot Result: Informed Encoder and Averaged Channel) Let the finite- 
dimensional Hilbert spaces TL and K, be given. Consider suhspaces £,Qi, ■ ■ ■ ,Gn C Ti. with dimS — k 
such that for all i ^ {1, . . . , N} the dimension relation k < dim^i holds. Let Afi, . . .Mn G C^{T-C, JC) each 
allowing a representation with Uj Kraus operators, j = 1, . . . , A^. Let C he any fixed set of 

unitary operators such that Vi£ C Qi holds for every i € {1, . . . , iV}. For an arbitrary set {ui}f^i C it(7i), 
define 



Ther 



1 ^ 



u(ei)x...xii(eN) 



1=1 



N ^ 

. . . duN > [-^^^-^ii^Gj)) 



-2^\\Uj{TTg^)\\2 

where the integration is with respect to the product of the normalized Haar measures on il{Qi 



(4) 



Proof. Our first step in the proof is to show briefly that Fc^eiT^£,J^ui....,UN) depends measurably on 
(Mi,...,UAr) e H^iGi) X ... X il{Q^). For each recovery operation TZ £ C{JC,Ti.) we define a function 
fn X ... xil(g^) ^ [0,1] by 

/•r(mi, . . .,un) -Fe(7r£, 7?. o7V„i, ...,„„). 

Clearly, f-fi is continuous for each fixed TZ G C(/C,?i). Thus, the function 



max fniui, . . . ,un) 
■Rec{KM) 



is lower semicontinuous, and consequently measurable. 

We turn now to the proof of inequality From Lemma [2] we know that there is a recovery operation TZ 
such that 

Fe{Tl£,TZoMm,...,UM) >W- llwPae - VUpa ^ p'Jll, (5) 

where we have used the notation introduced in the paragraph preceding Lemma [2l and 



w = vu{ui, . . . ,un) = tr(7V„i, 



For each e {1, . . . , N} let be the set of Kraus operators of A/}. Clearly, for every set ui, . 



, UN 



of unitary matrices, Afj oUj o Vj has Kraus operators given by 



bj^iUjVj. Utilizing the very 



same calculation that was used in the proof of Theorem [?] in [3] , which in turn is almost identical to the 
corresponding calculation in [5^ , we can reformulate inequality (O as 



i^e(7r£,7?.o A/'„^_..._„j^) >w - \\D{ui, 
with vu = tr(7V„i,...,„„(7r£)) and 



(6) 



D{ui,...,un) X! w X! ^{tj)iri)iU],ui)'S)\e^){er\'S>\fj){fi\ 



N 

i,r—l 
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where 



D(ij)(rl){uj,ui) i (^paj^^al^p- ^tr {pa* ,ai^rP)p^ , 



and p := kng is the projection onto £. Let us define 



The triangle inequality for the trace norm yields 

AT 

N 



< —^k min{nj , n; } 1 1 Dj- ; (uj , u/ ) 1 1 2 , 

= X — y/A: min{nj,n;}||£)j-;(uj,uz)||i, (8) 



where the second line follows from ||a||i < \/d||a||2, d being the number of non-zero singular values of a. 
In the next step we will compute ||-Dj,;(uj, We set pi := vipvi which defines new projections {pi}^i 

with supp(p;) C Gi for every I G {1, . . . , N}. A glance at ([7]) shows that 



Uj.ni 



{Dj^i{uj,ui))* = X {D(,j)(^ki){uj,ui))* (E)\ek){e,\®\fi){fjl (9) 

and consequently we obtain 

\\D],i{'^],ui)\\l = iT{{Dj^i{uj,ui))*Dj,i{uj,ui)) 

rij ,ni 

= X! tr((£'(y)(fci)(Uj,U;))*£'fe)(feo('"j''"')) 
j— l,r— 1 

"3 ,ni 



-2 X {tr(p(a*,a;,^)*pa*^a/,^) 



•i^l r— 1 



|tr(pa*,a;,^)| } 



k 

i—lr—1 



1, 



tr(p,;*u*6*,6,^,u,^;OI }• (10) 



It is apparent from the last two lines in ()10|) that ||^j,z(uj,u;)ll2 depends measurably on (ui, . . . ,un) G 
X ... X !d(QN)- Let Ui, . . . ,Un be independent random variables taking values in !d{Gi) according to 
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the normalized Haar measure on il{Gi) {i & {1, ■ • ■ , ^})- Then using Jensen's inequahty and abbreviating 
Lji := k mhi{nj , ni} we can infer from ([8]) that 

^ 1 I 

E(||I?(C/i,...,C/^)||i) < ^ -^L,iEi\\DM,Uim). (11) 

3.1 = 1 



Note that the expectations on the RHS of (|TT|) are only with respect to pairs of random variables 
Our next goal is to upper-bound ¥.{\\Dj^i{Uj, C/;)!!!)- 

Case j ^ I: Since the last term in PTT)) is non- negative and the random variables Uj and Ui are independent 
we obtain the following chain of inequalities: 

Uj ,ni 
i—\.r—l 

-'^niY{pV*U*blfil,rUlVi)\'' 

^ fc2 E ^tTipiu;bi,b,^,u,p,u*biM.rUi) 

i—l.r—l 

rij ,ni 

= E ^tr{Uipiurbi,b,,,u,p,u;b*M 

i—l.r—l 



rij .711 

^ E imuiPiunbi,b,Mu,P,u;)biAr) 



fc2 

i— l,r— 1 

^ rij-ni 
■i^l,r— 1 

= (AGK),M(^g,))H5, (12) 
where ( ■ , ■ )hs denotes the Hilbert-Schmidt inner product, and we used the fact that 

¥.{UipiU^) = k ■ TTg^ and E{UjPjU*) ^ k ■ ttq^. 
Case j = I: In this case we obtain 

n\\D,AU-J,Uj)\\l) = fc2 E ^t<P,U*blrb,,,U,p,U*bl,b,,rU,) 



1 

"k 



Mpv*u;bi,b,,rU,v,)\'' 



-2 E Etr(U,p,U*bl^b,,U,p,U;blfi,,r^ 

i=l,r=l 

-^E\tiiU,p,U;bl,b,,r)\']. (13) 
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Thus, the problem reduces to the evaluation of 

E{bupU'ix,y)}, {x,yeBin)) 
where p is an orthogonal projection with tr(p) ~ k and 

bupU'ix,y) tv{UpU*x*UpU*y) - ^tT{UpU*x*)triUpU*y), 

for a Haar distributed random variable U with values in il{G) where supp(p) d G dH. 
Here we can refer to [5D_ where the corresponding calculation is carried out via the theory of group 
invariants and explicit evaluations of appropriate integrals with respect to row-distributions of random 
unitary matrices. The result is 

P -1 I - P 

E{bupU' {x, y)} = -^p—jtT^{pgx*pgy) + ^(^2 _ -^^ tr(pga;*)tr(pgy), (14) 

for all x,y £ B{'H) where pg denotes the projection onto Q with tr(pg) = d. In Appendix [Al we will give 
an elementary derivation of (fT4|) for the sake of completeness. 
Inserting (|14p with x = y = b*^bj,r into yields with dj := tr(pg^) 

^ _ 1 "i-"] 



~MiPgSA,r)\ 



< 



^ _ 1 "j ."j 

^2 E ^"^(PQ.bl^bj^.pg^bl^bj^r) 

i=l.r=l 



1 v-^ 

- W2 E ^^iPS,blrbj,iPg,bl,bj,r) 

1 v-^ 

= ^2X1 ^^ib3,rPgjblrb3,tPg,bli) 

"-J i=l,r=l 

= (AG-(7re,),Ar,(^e,))Hs- 

Summarizing, we obtain 

Ei\\D,^,{U,,UM) < (AG-K),AGK))ifs = l|AGK)lli (15) 

Similarly 

1 ^ 1 

EitT{J^u,,...,uA^£))) = T7E^(tr(AG([/,-p,C/;))) 



AT 



i-Etr(AG(7rgJ). (16) 



TV 
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®, ®, dm), (Uni), and (HH) show that 

AT 

TV 



TV 

where for j,l G {1, . . . , N} we introduced the abbreviation 

Dji (AG-(7reJ,M(7rc;,))Hs, 

and, as before, 

Lji = kmm{nj,ni}. 

It is obvious that 

Lji < Ljj and Lji < Lu 
hold. Moreover, the Cauchy-Schwarz inequahty for the Hilbert-Schmidt inner product shows that 

< l|AG-K-)ll2||M(7ra,)||2 

< max{||AG(7reJ||i||M(^a,)ll2} 
= max{Djj, £)/;}. 

Therefore, an apphcation of Lemma [5] allows us to conclude from (|17p that 

N 

N 



N 

2Ev^ll-^:'-K)ll2, 



and we are done. 

□ 



3.4 Entanglement Fidelity 

The purpose of this subsection is to develop a tool which will enable us to convert a special kind of 
recovery maps depending on the channel into such that are universal, at least for finite compound channels. 
Anticipating constructions in section 2] below the situation we will be faced with is as follows. For finite 
set 3 = {A/i, . . . , A/jv} of channels, block length I e N, and small e > we will be able to find one single 
recovery map 7?.' and a unitary encoder W' such that for each i e {1, . . . , N} 

Feinr, , 7^' o Qi^, o A/;®' o W') > 1 - e, 
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where Qi,i{-) ■— qLi{-)qi,i with suitable projections qi^i acting on /C®'. Thus we will effectively end up with 
the recovery maps TZ\ :~ Ti) o Q/ ^. Consequently, it turns out that the decoder is informed. Lemma |8] 
below shows how to get rid of the maps Qi^i ensuring the existence of a universal recovery map for the 
whole set 3 while decreasing the entanglement fidelity only slightly. 

Lemma 8 Let p G S{Ti.) for some Hilbert space TL. Let, for some other Hilbert space IC, A (z C(7i, K.), V G 
C(/C,7i), q e B{IC) be an orthogonal projection. 

1. Denoting by the completely positive map induced by q^ := Ijc — q we have 

Feip,VoA) > Fe{p,V o Qo A)il - 2Fe{p,V o o A)). (18) 

2. Lf for some e > the relation F^^p^V oQo^) >1 — e holds, then 

F,{p,V o o A) < e, 

and U8\} implies 

Fe{p,V o A) > {1 - e){l - 2e) > 1 - '3e. (19) 

3. Lf for some e > merely the relation tr{qA{p)} > 1 — e holds then we can conclude that 

Feip,VoA)>F,{p,VoQoA)~2e. (20) 

The following Lemma [S] contains two inequalities one of which will be needed in the proof of Lemma [Sj 
Lemma 9 Let D G C(/C,7i) and xi _L X2, zi _L Z2 be state vectors, xi,X2 G /C, zi,Z2 G Then 

\{Z^M\XI){X2\)ZI)\ < ^\{zi,Vi\x,)(xi\)zi)\ ■ \{Z,,V{\X2){X2\)ZI)\ < 1, (21) 

and 

\{Z1,V{\XI)(X2\)Z2)\ < ^\(zi,V{\xi){xi\)zi)\ ■ \{Z2,V{\X2){X2\)Z2)\ < 1. (22) 

We will utilize only (PT|) in the proof of Lemma [S] But the inequality (|^^ might prove useful in other 
context so that we state it here for completeness. 

Proof of Lemma \^ Let dimTY h, dim/C = n. Extend {xi,X2} to an orthonormal basis 
{xi,X2, . . . ,Xk.} of /C and {21,^2} to an orthonormal basis {zi, Z2, . . . , Zh} on Ti.. Since xi _L X2 and 
Zi _L Z2, this can always be done. By the theorem of Choi [6], a linear map from B{'H) to B{IC) is com- 
pletely positive if and only if its Choi matrix is positive. Write I?(|xi)(xj|) — 1=1 -^nl-^k) {zi\. Then 
the Choi matrix of V is, with respect to the bases {xi, . . . , Xk} and {zi, . . . , z^}, written as 

K h 
ij = l k,l = l 

If CHOI(P) is positive, then all principal minors of CHOI(I?) are positive (cf. Corollary 7.1.5 in [T7]) and 
thus 

m < ^/\D^,\-\Din 
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for every suitable choice of i,j,k,l. Thus 



\{z,\V{\x,){x2\)z2)\ - \Dll\ 



< d\Dll\-\DM\ 



= V\{Z1M\X1){X1\)Z,)\ ■ \{Z2M\X2){X2\)Z2)\, 

and similarly 

\{zi,V{\xi){x2\)zi)\ < ^\{zi,V{\xi){xi\)zi)\ ■ \{ZIM\X2){X2\)Z1)\. 

The fact that T) is trace preserving gives us the estimate {zi,'D{\xj){xj\)zi) < 1 suitably chosen) and 
we are done. 

□ 

Proof of Lemma\^ Let dimTi = h, dim/C = k, G ® Ti he a, purification of p (w.l.o.g. 

7i(j = 7i). Set V := id-Ua ®T^: A :— id-j-c^ ^ A, q :— ® Q Siud, as usual, q-^ the orthocomplement of q 
within Ha <8) IC. Obviously, 

F,ip,VoA) = {^,VoAmmi^) 

= vi[q + q^]Ammq + q^]))^) 

+{^b,v{qAm{m'-W + {^Mq^MWimW 
> {i^,viqAm{m)^) + m{i^MqA\i^){M)q^m} 

= Feip,VoQoA)-2\{^,V{qAW{m^m- (23) 
We establish a lower bound on the second term on the RHS of Let 

K-h 



1=1 



where {ai, . . . are assumed to form an orthonormal basis. Now every Ui can be written as = 

aiXi + Piiii where Xi G supp(q) and yi e supp((7-'-), i e h}, are state vectors and ai,Pi £ C. 

Define a := (-01), then 

K-h 

^^Y. + la;,) (% I + P,a*\y,){x,\ + W\y,){y,\). (24) 
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Set X := \{^j,'b{qA{\i^){m^m- Then 

K-h 
K-h 

= \J2>^^^^P:{^M\^^){y^\m 
K-h 

< J2\x,a.p:\■\{i,,f>{\^.){y^\m 

i=l 

< |]|v/A,|(V',P(k,)(a;,|)V')a,v/A,(V',P(|y,)(y,|)V)A*l 

K-h K-h 

< ^A.|a.|2(^,P(|x.)(x.|)^)^A,|/3,f (V',P(|%)(%|)V>. (25) 
i=i j=i 

Here, a follows from using the convex decomposition of AQip) , b from utilizing inequality (|2ip from 
Lemma [9] and c is an application of the Cauchy-Schwarz inequality. 
Now, employing the representation (|24|) it is easily seen that 

K-h 

1=1 

and similarly 

K-h 

F,{p,v o Qo A) = Y.XM\'{^M\yj){yAW- (27) 

The inequalities j!!]), dM]), (ES]), and (gS]) yield 

F,{p,VoA) > Fe{p,VoQoA)~2F,{p,VoQoA)Fe{p,VoQ^oA) 

= Feip,VoQoA)il-2Fe{p,VoQ^ oA)) (28) 

which establishes p8)). 

Let us turn now to the other assertions stated in the lemma. Let tr{qA{p)} > 1 — e. This implies 
tr(g-'-^(p)) < e. A direct calculation yields 

tr(q^o-) = trnA^^)ci{'i-na<»q^)idn^<»A{\ij){^p\))) 

= tr^(Q^^(tr„„(|V')(^|))) 

= tT^{q^A{p)) 

< e. 



19 



Using ([M)) . we get the useful inequality 



e > 



Y,MP^\M<i^\y^){y^\) 



K-h 



(29) 



Using Lemma inland ((^ we get 



X 



< 



i=i j=i 



thus by equation (j23p we have 



F,{p,VoA)>Fe{p,VoQoA)~2€. 



In case that Fg^p,!) o Q o A) > 1 — e, we note that the linear maps Q and Q-^ are elements of C^{IC,IC) 
whilst Q + Q-*" G C(/C,/C) and since i^e is affine in the operation 



4 Direct Part of The Coding Theorem for Finitely Many Chan- 
nels 

4.1 Typical Projections and Kraus Operators 

In this subsection we recall briefly the well-known properties of frequency typical projections and reduced 
operations. A more detailed description can be found in 4 and references therein. 



F,{p,VoQoA)+F,{p,VoQ^oA)^Fe{p,Vo{Q+Q^)oA) < 1 




F,{p,VoA) > Fe{p,VoQoA){l~2F,{p,VoQ^oA)) 

> (l-e)(l-2e) 

> l~3e, 



which is the claim we made in (I19p . 



□ 
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Lemma 10 There is a real number c > such that for every Hilbert space Ti there exist functions 
/i : N — > R+, ip : (0,1/2) — > R_|- with lim;^oo /j(0 ~ and limg^Q ip(S) = such that for any p € 
S{TL), S e (0, 1/2), I e N there is an orthogonal projection qsj G S(7i)®' called frequency-typical projection 
that satisfies 

1. tr{p®\s^i) > i_2-'('='5'-''«), 
The inequality 2. implies 

\\qs,lp'''qsA\l<2-'^'^'^-^^'^^- 
Moreover, setting d diniTY, (p and h are given by 

h{l) = J log{l + 1) V/ e N, ip{S) -<51og - V(5 e (0, 1/2). 

Lemma 11 Let 7i,/C be finite dimensional Hilbert spaces. There are functions 7 : (0,1/2) h' : 

N ^ R+ satisfying lims^oj{S) = and h'{l) \ such that for each N € C{n,JC), S € (0, 1/2), / e N 
and maximally mixed state ng on some subspace Q CZ TC there is an operation Ms,i G C^(7i®', /C®') called 
reduced operation with respect to J\f and irg that satisfies 

1. tr{jVs,i{ng^)) > 1 - 2^'(^'*^"''''(')), with a universal positive constant c' > 0, 

2. Ms,i has a Kraus representation with at most ns^i < 2'^'^=('^s.-^)+7(>5)+'i (;)) ^^^j^^ operators. 

3. For every state p G 5(7^®') and every two channels I e {Ti'^^Ti'^^) and C G CJ-(/C®',H®0 the 
inequality Fe{p, C o JVs,i °T) < Fe{p, C o A/"®' o T) is fulfilled. 

Setting d :— diniTi and n dim/C, the function h' : N M+ is given by h'{l) — ^ log(Z + 1) G N and 
7 byj{S) = -5\og^, V<5e (0,1/2). 

4.2 The Case of Uninformed Users 

Let us consider a compound channel given by a finite set 3 :— {A/i, . . . jMn} C C(7Y,/C) and a subspace 
G C Ti.. For every Z e N, we choose a subspace £1 C t?®'. As usual, tt^, and ng denote the maximally 
mixed states on respectively G while fc; := dim 5; gives the dimension of £1. 

For j e {1,...,N}, 5 G (0, 1/2), I e N and states A/'j(7re) let qjj^i e ^(/C)®' be the frequency-typical 
projection of Mji'Kg) and Mj^s.i be the reduced operation associated with A/} and vrg as defined in Subsec. 
01 

These quantities enable us to define a new set of channels that is more adapted to our problem than the 
original one. We set for an arbitrary unitary operation e S(7i®') 

and, accordingly, 

1 ^ 
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We will show the existence of good codes for the reduced channels Qj,s,i °-^j,5,i in the limit of large I £ N. 
An application of Lemma [8] and Lemma [TT] will then show that these codes are also good for the original 
compound channel. 

Let [/' be a random variable taking values in which is distributed according to the Haar measure. 

Application of Theorem d] yields 

N 

EF,,,(7r£„A^^,_,) > tr(A^i(^f))-25^v/fc;^||^',,(7r|')||2, (30) 
where nj g i stands for the number of Kraus operators of the reduced operation Mj^s.i (j G {I7 • ■ • 7 N}) and 

Notice that Qj^s^i o Mj^s,i trivially has a Kraus representation containing exactly rij^s^i elements. We will 
use inequality (|30p in the proof of the following theorem. 

Theorem 12 (Direct Part: Uninformed Users and \3\ < 00) Let 3 = {Afi, ...,AfN} C C{H,IC) be a 
compound channel and ng the maximally mixed state associated to a subspace Q C H. Then 

Q{3) > min Ic{Trg,Mi). 

Proof. We show that for every e > the number minjv^g^ /^(Trg;, A/i) — e is an achievable rate for 3. 

1) If minjv;ga /c(7rg,A/i) — e < 0, there is nothing to prove. 

2) Let minTV'isa /c(7rg, A/i) — e > 0. 

Choose S £ (0, 1/2) and Iq £ N satisfying ^{S) + (p{6) + h'{lo) < e/2 with functions 7, (p, h' from Lemma 
[10] and [m 

Now choose for every / G N a subspace £1 C t/®' such that 

By S{ng) > Ic{ng,J\fj) (see ^1^), this is always possible. 
Obviously, 

min Ic{ng,Ni) - e - o{f ) < - logh < rain Ic{ng,f>/i) - e. 
We will now give lower bounds on the terms in ()30|) . thereby making use of Lemma [TOl and Lemma [TT] 

tr(A^^(^f)) > 1 - 2-'(=*'-''(')) - 2-'(='*"-'''(')). (31) 

A more detailed calculation can be found in [3] or [^. Further, and additionally using the inequality 
+ > + \\B\\l valid for non-negative operators A,B £ B{]C®^) (see [20]), we get the inequality 

\WlA^f)\\l ^ 2-'(^(^^("s))-¥^(5))^ (32) 
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From ([50]) . ([HT|) . ([5^ and om' specific choice of fc/ it follows that 



TV 



V2'(Tl°g'=i+7('5)+v(-5)+'>'(i)--fc('re,AG) 

J = l 



-2iV\/2-'(e-7('5)-v('5)-ft.'(0) . 

Since e — 7(5) — (p{S) — > e/2 for every ^ > Iq, this shows the existence of at least one sequence of 
{I, fc/)— codes for J with uninformed users and 

liminf - log A:/ = min iJi^g^Mi) — e 

as well as (using that entanglement fidelity is affine in the channel), for every I g N, 

min Fein^^n' oM'oW')>l- N^ei (33) 

]&{!.,. ..,N} ■'■ 3 

where € il(^;®') G N and 



= 3 • (2-'('='5' + 2-'(='*'-'''(')) + 2ivV2-'(f-7('5)-v('5)-v(0)). (34) 

Note that lim/^00 = exponentially fast, as can be seen from our choice of 5 and ^o- For every 
j S {1, . . . , N} and / G N wc thus have, by property 3. of Lemma [TTl construction of A/j g, and equation 

(ESI), 



Fe{TT:F, , 7^' o Qj^sA o AAf o W') > Fe(^^, , 7^' o Q^^sa o A^,- 5,, o W') 



Fe(^^,,7^'oA^' ,,) 
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min FeiTTjr^ , 7^' o TV®' o w') > 1 - Nei V/ e N. (35) 



□ 



By the first two parts of Lemma [8l this immediately implies 

min i^e(7rjr, , 7^' o A/"®' o w') > ] 

Since e > was arbitrary, we have shown that min^viea /c(7rg, A/i) is an achievable rate. 

4.3 The Informed Encoder 

In this subsection we shall prove the following Theorem: 

Theorem 13 (Direct Part: Informed Encoder and \3\ < 00) For every finite compound channel 
3 = { Ai , . . . , A/at } C C{TL,1C) and any set {tt^^, . . . , Trg^} of maximally mixed states on subspaces 
{Qi, ■ ■ ■ , Qn} with Gi CH for all i G {1, . . . , N} we have 

Qie{3) > min /c(7I■g,,7Vi)• 
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Proof. Let a compound channel be given by a finite set J := {A/i, . . . , A/at} C C{H, JC) and let ^i, ... , Qm 
be arbitrary subspaces of TL. We will prove that for every e > the value 

i?(e) := min Ic{ng.,J\fi) - e 

l<i<N 

is achievable. If R{e) < 0, there is nothing to prove. Hence we assume i?(e) > 0. For every I S N and all 
i £ {!,..., N} we choose the following. First, a subspace £i C H®' of dimension ki := dimf; that satisfies 
ki < dimQf'K Second, a set {v[, . . . of unitary operators with the property w'f/ C C/f ^ Again, the 
maximally mixed states associated to the above mentioned subspaces are denoted by tt^, on £/ and ttq. 
on Gi. 

For j € {1, . . .,N}, 6 € (0, 1/2), I G N and states Nj{TTg.) let qj,s^i £ B{1C)®^ be the frequency-typical 
projection ol Njij^g-) and A/j/,i be the reduced operation associated with Nj and irg. as considered in 
section 14.11 

Let, for the moment, ? € N be fixed. We define a new set of channels that is more adapted to our problem 
than the original one. We set, for an arbitrary set {u\, . . . , m^v} of unitary operators on H®' 



and, accordingly, 

i=i 

we will first show the existence of good unitary encodings and recovery operation for {M[ ■ ■ ■ , A/'a? s)- 
Like in the previous subsection, application of Lemma [5] will enable us to show the existence of reliable 
encodings and recovery operation for the original compound channel 3. 

Let U[,...,U\q be independent random variables such that each Ul takes on values in il(^f') and is 
distributed according to the Haar measure on {i £ {1, ■ • . By Theorem [7] we get the lower 

bound 

EF,,,{7r£„Aflj,_^j,^^^) > ^[^^^'^(-^^^('^ef' )) - 2v/fc;^||A^j^5(7r5«0ll2], (36) 

i=i 

where nj^s,i denotes the number of Kraus operators in the operations Mj,s.i {j £ {1, • ■ • , N}). By Lemmas 
I10|lll for every j £ {1, . . . , N} the corresponding term in the above sum can be bounded from below 
through 

ltr(^j^,(^g«0) > ^(1 - 2-'(^^'-''«) - 2-'(-'*'-'''('))) 

and 

-2^/h^lWls{TTg^i)\\2 > ~2\lkl ■ 2'(-«""l<.<«^=(-S,.A/-.)+7W+'P(5) + '«'(0), 
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Set ki [2'«('=)J . Obviously, for any j G {1, . . . , TV}, 

ki ■ 2'(^"""i2j^" ■'^'^('^Oj^-'^j)) < 2-'<^ 

This implies 

-2iV\/2'(-<^+'y('5)+¥'('5)+'''W). 

Now choosing both the approximation parameter 5 and an integer E N such that —e+^{S)+(p{S)+h'{l) < 
— ie holds for every I > Iq and setting 

6/ 2"'('"'^~''(')) + 2-'('='''^-'''W) + 27Va/2'(-<^+t('5)+¥'('')+'''W) 

we see that 

where again e; \ and our choice of 6 and Iq again shows that the speed of convergence is exponentially 
fast. Thus, there exist unitary operators w{, . . . ,w^j^ C and a recovery operation TZ' such that, 

passing to the individual channels, we have for every j e {!,..., N} 

Fe{7T£, , TZ' o Qj^sj o Afj,sj oW!j)>l-Nei. 

By property 3. of Lemma [TT] and Lemma 13 we immediately see that 

Fe(7^£„7^'oAAf' oWj) > l-3Nei Vj G {1, . . . , iV} 

is valid as well. We finally get the desired result: For every set {tt^j, . . . ,Trg^} of maximally mixed states 
on subspaces Gi, ■ ■ ■ ,Gn C H and every e > there exists a sequence of {I, ki) codes for 3 with informed 
encoder with the properties 

1. liminf;^oo jlogfc; = minA/-^.g3 /^(Trg;^. , A^j) - e, 

2. minAf^e^^^e(^£,,7^'oAAf' oWj) >l^?,Nei. 
Since e > was arbitrary and e/ \ 0, we are done. 

□ 

5 Finite Approximations in the Set of Quantum Channels 

Our goal in this section is to discretize a given set of channels 3 e C(7i, /C) in such a way that the results 
derived so far for finite sets can be employed to derive general versions of coding theorems for compound 
channels. 

The first concept we will need is that of a r-net in the set C{H,IC) and we will give an upper bound on the 
cardinality of the best r-net in that set. Best r-nets characterize the degree of compactness of C(7i, K,). 
A r-net in C(?i,/C) is a finite set {Afi}^i with the property that for each Af G C(7i,/C) there is at least 
one i e {1, . . . , N} with IjA/" — A/i||o < r. Existence of r-nets in C{H, JC) is guaranteed by the compactness 
of C(7i,/C). The next lemma contains a crude upper bound on the cardinality of minimal r-nets. 
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Lemma 14 For any r G (0, 1] there is a T-net {M}^i inC{n, K.) with N < (|)^('*■'^')^ where d = diiaU 
and d' = dim/C. 

Proof. The assertion of the lemma fohows from the standard volume argument (cf. Lemma 2.6 in [24]). 
The details can be found in our previous paper [4 . 

□ 

Let 3 C C{n,IC) be an arbitrary set. Starting from a r/2-net m := with N < (|)2('i rf')' as in 

Lemma [TH we can build a r/2— net 3'^ that is adapted to the set 3 given by 

a; -.^ {^f^ e m : 3Af e 3 with ||AA-M||o < r/2}, (37) 

i.e. we select only those members of the T/2-net that are contained in the r/2-neighborhood of 3. Let 
T e C{H,IC) be the useless channel given by T{p) := gj^^lx:, P G S{H), and consider 

3r -.^ {{I -lW+lT:Afe 3'^}, (38) 

where 3'^ is defined in For 3 C C{n,IC) we set 

for /5 S S{'H). We list a few more or less obvious results in the following lemma that will be needed in the 
following. 

Lemma 15 Let 3 C C{H,IC). For each positive t < ^ let 3r be the finite set of channels defined in i38]) . 
1- \3r\ < (f )2('' ''')' with d ^ dimU and d' ^ dim/C. 

2. For Af e3 there is Ni £ 3r with 

\\Af^' -Af^^' Wo < It. (39) 

Consequently, for N , and any CPTP maps V : B{T) B{n)^^ and U : B{JC)'>^^ -> B{P) the 
relation 

\F,{p,'Ro M®'' oV) - F.ip^Tlo oV)\ <It (40) 
holds for all p G 5(7^®') and I G N. 

3. For all p G S{Ti.) we have 

\Up,3) ~ Ic{p,3r)\ <T + 3rlog-. (41) 

T 

Proof. The proofs of the assertions claimed here are either identical to those given in [4j or can be obtained 
by trivial modifications thereof. 

□ 
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6 Direct Parts of the Coding Theorems for General Quantum 
Compound Channels 

6.1 The Case of Informed Decoder and Uninformed Users 

The main step towards the direct part of the coding theorem for quantum compound channels with 
uninformed users is the foUowing theorem. 

Lemma 16 Let 3 e C(7Y,/C) be an arbitrary compound channel and let -Kg be the maximally mixed state 
associated with a subspace Q (ZTi. Then 

Proof. We consider two subspaces 5;, t/®' of TL®^ with 8i C Q®^ C 7i®'. Let ki := dimf; and we denote as 
before the associated maximally mixed states on £i and Q by Trg, and ivg. 

If infAAea Ic{TTg,JV) < there is nothing to prove. Therefore we will suppose in the following that 

inf lJTTg,J\f) > 

holds. We will show that for each e e (0,infA/'ga Ic{ng,J\f)) the number 

mi^Ic{T:g,Af) -e 

is an achievable rate. 

For each Z e N let us choose some t; > with ti < ^, lim;^oo Iti = 0, and such that AT^, grows sub- 
exponentially with I. E.g. we may choose t; := min{l/e, We consider, for each I e N, the finite 
set of channels 3r; := {A/i, . . . ^Mn^^} associated to 3 given in ([38|) with the properties listed in Lemma 
[TSl We can conclude from the proof of Theorem [T2l that for each I e N there is a subspace Ti C G®'' of 
dimension 

ki = [2'(™™ie{i N^} ■f<:('^s.A/'d-f)j^ (42) 

a recovery operation TZ, and a unitary encoder such that 

min Fe(7r^,,7^oA/:®'o W') > l-iV,,^ (43) 

{ 1 , . . . , A^T^ } 

where ei is defined in ([34| (with the approximation parameter e replaced by e/2), and we have chosen 
l,lo e N with I > Iq large enough and S > small enough to ensure that both 

min Ici-Kg.Ni) ^ o > 0; 

iG{l,...,JV^,} 2 

and 

^-^^{5)~^{5)~h\h)>e/^>Q. 
By our construction of 3ri we can find to each A/" € U at least one Mi G with 

|Fe(7r^, ,110^1^0 W') - Fe(7r^, , 7^ o AA®' o W')! < r rz (44) 



27 



according to Lemma [T51 Moreover, by the last claim of Lemma [T51 we obtain the following estimate on 
the dimension ki of the subspace J-/: 

ki> ^2^(^'^^^<^^ ^':(^S'^)~i~^i-^'^'^°eT^ ^ (-45) 

The inequalities ([321) and (gj) show that 

inf Feinr, , 7^ o AA®' o W') > 1 - Nr.ei - hi, 

which in turn with (I45p shows that infA^ea Ic{Trg,M) is an achievable rate. 

□ 

In order to pass from the maximally mixed state ng to an arbitrary one we have to employ the compound 
generalization of Bennett, Shor, Smolin, and Thapliyal Lemma (BSST Lemma for short) from and jl6| . 
For the proof of this generalized BSST Lemma we refer to [4J. 

Lemma 17 (Compound BSST Lemma) Let 3 C C(7i,/C) be an arbitrary set of channels. For any 
p E S{H) let qs^i € B{H^'') be the frequency-typical projection of p and set 

Then there is a positive sequence ((5i)igN satisfying limj^oo 5i — with 

lim ) inf /,(^5,.i,AA®') = inf Ic{p,J^). 

With these preparations it is easy now to finish the proof of the direct part of the coding theorem for the 
quantum compound channel with uninformed users. 
First notice that for each A; G N 

gp®'^) = kQ{3) (46) 

holds. For any fixed p G S{H®"^) let qsj G B{H®™^) be the frequency-typical projection of p and set 
T^s.i = t T'' \ ■ Lemma [TBI implies that for any 6 E (0, 1/2) we have 

Q(a®™') >/,(^5,i,3»"'), (47) 
for all m,l E N. Utilizing (|46p . (|T7)) and Lemma [T71 we arrive at 

Q{3) = - lim iQ(3®™') 

TO l~>oo L 

> - lim 1 inf /.(tt,,,,, (AA®™)®') 

- l/,(p,3®™). (48) 

TO 

From (|48l) and since Qid{3) > Q{3) trivially holds we get without further ado the direct part of the 
coding theorem. 
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Theorem 18 (Direct Part: Informed Decoder and Uninformed Users) Let 3 C C(7i,/C) be an 
arbitrary set. Then 

Q/Dp) > QP) > lim i max ini Up, Af^^). (49) 

Remark 19 It is quite easy to see that the limit in l^gp exists. Indeed it holds that 

max inf IJp,N®^+^) > max inf lJp,N®^) 

+ max inf lJp,M®'') 

which implies the existence of the limit via standard arguments. 

6.2 The Informed Encoder 

The main result of this section wiU rely on an appropriate variant of the BSST Lemma. To this end we 
first recall Holevo's version of that result. For (5 > 0, ^ e N, and p £ S{H) let qs^i £ B{H^^) denote the 
frequency typical projection of p®'. Set 

Moreover, let 

Amin(/3) := min{A G a{p) : A > 0}, 
where (j{p) stands for the spectrum of the density operator p. 

Lemma 20 (BSST Lemma [2], |16] ) For any 6 £ (0, 2 dim h ) ' ^ C(7i,/C), and every p £ S{Ti.) 

with associated state irg.i = t^s.i{p) € iS(7i®') we have 



<6i{5,\^Up),^minW{p))) (51) 



where 



A ' 1—/ 

ei{S,Xmin{p),Xmin{M{p))) = log(/ + 1) - dim 7^ • <5 log ^ 

-dimn-S- (log A^i„(p) + log A^i„(AA(p))). (52) 

Before we present our extended version of BSST Lemma we introduce some notation. For t £ (0, i) and 
any set 3 C C{7i,JC) let us define 



t(*) — 



{AA(*) = (1 - t)JV + tTK: -.JV £3} = (l- t)3 + tT^:, (53) 



where T £ C{nX) is given by r,c{x) I^Ik- 

On the other hand, to each M £ 3 £1 C{Ti.,IC) we can associate a complementary channel Afc £ C{Ti,Tie) 
where we assume w.l.o.g. that TLe = C^™'^ '^™'^. Let 3' C C{H,He) denote the set of channels comple- 
mentary to 3 and set 

:= (J')(*) = {AA(*) = (1 - t)Af, + tTn^ : K e 3'} = (1 - t)3' + tTn^ , (54) 
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where T-u^ E C{Ti,, He) is the defined in a similar way as Tic- Finally, ior J\f E 3 let 



Pa^ := arg max Ic{p,N), 



and for t e (0, -), 6 > 0, and I e N define 



(*) f it)\ 



where we have used the notation from (|50p and 



(55) 



(56) 



Lemma 21 (Uniform BSST-Lemma) 1. Let I e N, t e (0, ■^), and 5 G (0, jdH^)- '^^^'^ '^^^^ 
notation introduced in the preceding paragraph we have 



\ inf /,(7rf,V,AA®') - inf max Ic{pM) 



with 



-4t log 



t 



t 



dim 7i ' dim /C 

t 



dim tC ■ dim Tie 



+ 2(?i yd, - 

- 2lt log 



t 



t 



diniTi' dim Tie 
It 



dim /C • Tie ' 



where 9i {5, and 9i [s, are /rom LemmaM 

2. Consequently, choosing suitable positive sequences {Si)i^jq, (t;)'6N with 

1. lim/^00 5; = = lim/_>oo Iti, and 

2. lim/^00 dl log t; = 

we see that for vi :— Ai{Si,ti) 



1 



I A^ea 



inf /e(4*',\.,A/'®') - inf max Ic(p,^f) 



holds with lim;^oo vi = 0. 



(57) 



Proof. Our proof strategy is to reduce the claim to the BSST Lemma [20l Let t > be small enough to 
ensure that I ■ t G {0, ^) and let S G (0, 2 dim n ) given. From and ([5^ we obtain that 



Amin(AA(*)(p)) > 



t 



dim/C ' 



A.„i„(A((*Hp)) > 



dim Tie 



V P e 5(Ti), 



and ((56)) yields that 



Amin(pjv''') > -T7 



dimTi 



(58) 
(59) 



30 



for all N el. The bounds ([55]) and along with Lemma show that 



and 



i5((AA(*))«'(7ra^))-5(AA(*)(p(^))) 



dim ' dim K, 



t 



t 



dim Ti dim Ti. 



On the other hand, by definition we have 

||AA(*' - A^llo < i, ||(AA(*))®' - AA®'||o < r 

and similarly 

I |AA« - AA,| lo < t, II (A^i*))®' - AA®' I lo < / • t, 
for all A/" e 3. Since / • t G (0, -j) we obtain from this by Fannes inequality 

\S{M^'\p^f^))-S{M{p^^))\<-t\og^-^, 



and 



as well as 



' dim Tie 



< -I -tlog 



< -l-t log 



l-t 

dim K, ' 

l-t 
dim Tie ' 



for all A/" € 3. Since 
and 



hip^^M) = S{N{p^;^)) - S{N.{p^^)) 



(60) 
(61) 

(62) 
(63) 

(64) 
(65) 

(66) 

(67) 



the inequalities (pO)) .(pT |) . ([M)) . (pS)) . (p7)) and triangle inequahty show that uniformly in A/" G 3 we 

have 



< 



S, 



t 



t 



^ilog — 



dim Ti ' dim /C 
t 



t 



t 



dim K. ■ dim ?i . 



r i log — 



dim 7i dim Tie 
l-t 



dim /C • Tie 



(68) 



Now, by ([561) we have 
which implies 



||pj^^-p^||i<i 
||AA(p(^V A/'(paa)IIi < t, \WrXp^^)~M.{pu)\\i < t, 
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since the trace distance of two states can only decrease after applying a trace preserving completely positive 
map to both states. Thus Fannes inequality leads us to the conclusion that 



ic{p'^^,U)-ic{p^M) 

This and (|68p shows that uniformly in A/" G J 



< -t log — ■ 



t 



dim/C • dim Tie 



t 



t 



'2t\og — 



dim Ti ' dim JC 

t 



+ Oi{d, 



t 



t 



dim K ■ dim T-L„ 



l-t\og — 



dim TL ' dim TLg. 
I ■ t 



dim K ■ Tie 



Finally, it is clear from the uniform estimate in ()69p that 



(69) 



\ inf /cCvrf.^jv-^A/'®') - inf max Ic{p,J^) 



which concludes the proof. 



1 



^?IIc{4UM^') - inf /c(pA.,AA) 



(70) 



□ 



Lemma [5T] and Theorem 1131 easilv imply the following result. 
Lemma 22 Let J C C(7i,/C) 6e an arbitrary set of quantum channels. Then 

QieO) > inf max Ic{p,JV). 

Proof. Take any set {Trg^jTv'ea of maximally mixed states on subspaces G Ti.. In a first step we will 
show that 

Q/b(3)> mf^/c(7rg^,AA) (71) 
holds. Notice that we can assume w.l.o.g. that inf^/^^j Ic{TTgj^,Af) > 0. 

Denote, for every r > 0, by 3,- a r-net for J as given in ([38| of cardinality Nr := \3r \ < (|)2('i '^') , where 
d,d' are the dimensions of 7i, /C. Starting from this set 3r it is easy to construct a finite set 3° with the 
following properties: 

1. 3° C 3, 

2. \3°\ < {^?'-'^"^">\ and 

3. to eachWe 3 there is at least one TV' e 3° with ||7V-7V'||o < 2t. 

Let (T/)jgN be defined by t; := ^ and consider the sets 3°^, I E N. 
Take any 77 € (0, infA/'gji /c(7rg^, A/")) and set 



^l*?) _mf/c(7re^,AA) -77, 
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and 



Then for every / € N, 

Riiv) > R{v) (72) 

since 3°^ C 3. 

Fix some S' e (0, 1/2) such that ^{6') + if {6') < rj/A. For every I g N, choose a subspace £i C Ti®^ of 
dimension 

ki{-q) :=dimfi = [2'''''''^]. 
The proof of Theorem [T3] then shows the existence of a recovery operation Ti} and for each M' G 3° a 

Fe(7r£,,7^' o AA'®' o Wj^,) > 1 - 3 • A^r, • e; V AA' e 3°,, 



unitary encoder Wif, such that for each / € N 



where := 2-'('='5"-''(0) +2-'(='''"-'''('» +2A^^, V 2'(-t'+'''W)). From Lemma[l5]along with the properties 
of and our specific choice of (Ti);gN it foUows that there exist unitary encodings Wj^ (for every / G N 
and each A/" G J) , such that 

Fe(7^£:,,7^' o A/"®' o Wj^) > 1 - 3 • 7V^, • e/ - y V/ e N, A/" e 3. 

Clearly, lim,^oo -F'e (tt^, , 7^' oTV®' o Wj^) = 1 and ^ implies for each -q e (0,infjv-g3 /^(Trg;^, TV)) that 

liminf — \ogki{rj) ~ liminf — logdimf; > R{r]). 

I — >oo I I — 'OO / 

Consequently vaij^szj Ic{'Kgj^ ,N) is achievable. 

We proceed by repeated application of the inequality 

Qie{3) > jg/Bp®') (V/ e N). (73) 

From (ffTjl and ([73|l we get that for each / e N and every set {7r^}7v'G3 of maximally mixed states on 
subspaces of H®' , 

Q/i;(3) > y jnf/c(^V,A/'^')- 

We now make a specific choice of the states tt^, namely, for every A/" G J and / e N, set vr^ :— T^sl'j j\f 

with T^sl'lj^ taken from the second part of Lemma [^T] By an application of the second part of Lemma [H] 
it follows 

Qie{3) > lim i inf I,{tt'_^,J^^') 

> lim ( inf Idpj^M) ^ ^i) 
i^oo j\fed 

— inf max I J P.N). 

Afe3 pesiH) 
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□ 

Employing inequality ([75)1 one more time we obtain from Lemma applied to 

> i inf max IJp,M®^). 
Consequently we obtain the desired achievability result. 

Theorem 23 (Direct Part: Informed Encoder) For any 3 E C(7i,/C) we have 

QieO) > lim \ inf max /c(p,7V®'). (74) 

Remark 24 Note that the limit in exists. Indeed, set 

Ci{Af):^ max h{pM®'). 

Then it is clear that 

Ci+k{J^) > CiiAf) + CkiAf) 



and consequently 



inf Ci+k {M) > inf {Ci (JV) + Ck (M) ) 



> inf Ci(Af)+ inf Cfc(AA), 



which implies the existence of the limit in |7^p . 



7 Converse Parts of the Coding Theorems for General Quantum 
Compound Channels 

In this section wc prove the converse parts of the coding theorems for general quantum compound channels 
in the three different settings concerned with entanglement transmission that are treated in this paper. 
The proofs deviate from the usual approach due to our more general definitions of codes. 



7.1 Converse for Informed Decoder and Uninformed Users 

We first prove the converse part in the case of a finite compound channel, then use a recent result [22j 
that gives a more convenient estimate for the difference in coherent information of two nearby channels 
in order to pass on to the general case. 

For the converse part in the case of a finite compound channel we need the following lemma that is due 
to Devetak TOl: 
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Lemma 25 (Cf. \10\ ) For two states a, p E S{Hi^H2) where dini'Hi(E)'H2 = b with fidelity f = F{a, p), 

\AS{p)-AS{a)\ < ^+41og(6)yW, 

where 

We shall now embark on the proof of the following theorem. 

Theorem 26 (Converse Part: Informed Decoder, Uninformed Users, |3| < oo) Let 3 = 

{A/i, . . . , A/at} C C(Ti,IC) be a finite compound channel. The capacities Qid{^) and Q(3) of J 
with informed decoder and uninformed users are bounded from above by 

QO) < QidO) < lini max min ^lJp,JV,^^). 

Proof. The inequality Q{3) < Q{'3)id is obvious from the definition of codes. We give a proof for the 
second inequality. Let for arbitrary I G N an {I, ki) code for a compound channel 3 = {Afi, . . . , A/at} with 
informed decoder and the property mini<i<Ar Fe{'Kjr^ , TZ\ o Af^'' o 7") > 1 — e/ be given, where e; G [0, 1]. 
Let \ipi){ijji\ G S{£i (g) J^i) be a purification of njr^ where £; is just a copy of J^i. We use the abbreviation 
2?' := Tf E^i ° K^^- Obviously, the above code then satisfies 

1 ^ 

> l-e;. (75) 

Let a-pi :— idg^ '^V''{\'ip^){ip''\) and consider any convex decomposition a-pi = Ylit™'^'^ of api 

into pure states |ei)(ei| G 5(JF/ ^ H®'). By ([75]) there is at least one i G {1, . . . , (dim J^;)^} such that 

{■:Pi,id£,(g)V'i\e^){e,\)^i) > 1 - e/ (76) 

holds. Without loss of generality, i = 1. Turning back to the individual channels, we get 

{^i,id£,®'R'^oMf'{\ei){ei\)iPi)>l-Nei Vz G {1, . . . , TV}. (77) 

We define the state := tr^j (|ei)(ei|) G 5(7^*^') and note that |ei)(ei| is a purification of pK Application 
of recovery operation and individual channels to p' now defines the states (t[ := ids, ® 7^1 °-^^'(|ei)(ei|) 
(A: G {1, . . . , N}) which have independently of k the property 

^^(V'',^)- (7/^^,^d£,®7^1oAAf (|e,)(e,|)V/) >l~Nei 

and thus put us into position for an application of Lemma 1251 which together with the data pro- 
cessing inequality for coherent information [55] establishes the following chain of inequalities for every 
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ke{l,...,N}: 

log dim = <S'(vrjc-j) 

e 

= S{tTe,i^de,®nioM^^i\e,){e,\)) S{^de, ® TZ[ o M^\\e^) {e^\)) 

+ -+41og((dim^0')V^ 

e 

- /,(p^7^i o AAf ) + - +41og((dim^0')V^ 
e 

< /,(p',AAf ) + ^+41og((dim^O')0V^- (78) 
logdim^j < min I,{p\Uf) + -+4log{{dimJ^if)y^i 

k£{l,...,N} e 

< max min Up,J\ff) + -+8\og(dimJ'i)JN7i. (79) 

Let a sequence of {l,ki) codes for 3 with informed decoder be given such that hminf;^oo jlogdimJF; = 
ReM. and hm;^oo £; = 0. Then by we get 

B = hm inf — log dim JT; 

I — >oo I 

< lim inf— max min Ic(p,Aft 

l^oo I peSiH»')ke{l,...,N} 

1 2 

+ lim inf - — h lim inf 8 log(dim J^/)-\/ Nei 

I — ^oo / 6 I — >oo 



Thus, 



lim ^ max min iJpMf^), 
i^oa I pes{ne")ke{i,...,N} 



□ 



Let us now focus on the general case. We shall prove the following theorem: 

Theorem 27 (Converse Part: Informed Decoder, Uninformed Users) Let 3 C C(?i,/C) be a 
compound channel. The capacities QidC^) cind Q{J) for 3 with informed decoder and with uninformed 
users are bounded from above by 

QO) < QidO) < lim max inf \lJpM'^^)- 

For the proof of this theorem, we will make use of the following Lemma: 

Lemma 28 (Cf. [22]) Let JV, Aft E C(7i,/C) and d/c = dim/C. Let Ti,,. be an additional Hilbert space , 
leN and<pe S(Hr(E>n^^). //||AA-M||o < e, then 

ISiidn^^jV^'i^)) ~ S{idn^®Mf\m < l{4e\og{d^) + 2h{e)). 

Here, h(-) denotes the binary entropy. 
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This result immediately implies the following Lemma: 

Lemma 29 Let TL, JC be finite dimensional Hilbert spaces. There is a function v : [0, 1] — > R+ with 
lima;^o = such that for every 3,3' C C(7i, /C) with D0{3,3') < r < 1/2 and every I Cz N we have 
the estimates 



1. 



,1 



,,inf max IJp,N®^)- \ inf max IJp,N'®^)\<u{2t) 

The function v is given by ^{x) — x + Sxlog^djc) + 4/i(x). Again, h{-) denotes the binary entropy. 

Proof of Theorem \27\ Again, the first inequality is easily seen to be true from the very definition of codes 
in the two cases, so we concentrate on the second. Let 3 C C(7i,/C) be a compound channel and let for 
every / e N an {l,ki) code for 3 with informed decoder be given such that liminf;_,(x) jlogfc; — R, and 
lim,^oo inf AAea (tt^, o o V') = I hold. 

Take any < r < 1/2. Then it is easily seen that starting with a ^-net in C{Ti,K,) wc can find a set 
a; = {TVi, . . . ,AfN^} C 3 with \Nr\ < (l)2(dimH-dim/c)^ ^^^^1 that for each JV e 3 there is G 3'^ with 



IIAA-Mllo <T. 

Clearly, the above sequence of codes satisfies for each i € {1, . . . , TVt-} 

1. liniinf(^oo j logfc; = R, and 

2. limi^oominA/-,ga^ Fe(7r^,,7^' o A/-®' o T'O = 1. 
From Theorem 1261 it is immediately clear then, that 

R< lim max min -iJpM?^) 
and from the first estimate in Lemma we get by noting that Do{3, 3'^) < t holds 

i?< lim max M -Up,Af'^^) + v(2t). 
Taking the limit r ^ proves the theorem. 
7.2 The Informed Encoder 

The case of an informed encoder can be treated in the same manner as the other two cases. We will just 
state the theorem and very briefly indicate the central ideas of the proof. 

Theorem 30 (Converse Part: Informed Encoder) Let 3 C C{H,IC) be a compound channel. The 
capacity Qie{3) for 3 with informed encoder is bounded from above by 

Qie(3) < lim \ inf max LJp^Af^^). 
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Proof. The proof of this theorem is a trivial modification of the one for Theorem [^T] Again, the first part 
of the proof is the converse in the finite case, while the second part uses the second estimate in Lemma 



For the proof in the finite case note the following: due to the data processing inequality, the structure of 
the proof is entirely independent from the decoder. A change from an informed decoder to an uninformed 
decoder does not change our estimate. The only important change is that there will be a whole set 
{el_^, . . . , ef^} of vector states satisfying equation ((76|) . one for each channel in 3. This causes the state 
in equation ([75)1 to depend on the channel. 



8 Continuity of Compound Capacity 

This section is devoted to a question that has been answered only recently in |22| for single-channel 
capacities, namely that of continuity of capacities of quantum channels. 

The question is relevant not only from a mathematical point of view, but might also have a strong impact 
on applications. It seems a hard task in general to compute the regularized capacity formulas obtained 
so far for quantum channels. There are, however, cases where the regularized capacity formula can be 
reduced to a one-shot quantity (see for example [S] and references therein) that can be calculated using 
standard optimization techniques. 

Knowing that capacity is a continuous quantity one could raise the question how close an arbitrary 
(compound) channel is to a (compound) channel with one-shot capacity and thereby get an estimate on 
arbitrary capacities. 

We will now state the main result of this section. 

Theorem 31 (Continuity of Compound Capacity) The compound capacities Q{ ■ ), Qid{ ■ ) and 
Qie{ ■ ) a^e continuous. To be more precise, let 3,3' C C(7i,/C) be two compound channels with 
D(){3,3') < e < 1/2. Then 



where the function v is taken from Lemma [2^ 

Remark 32 Let 3 C C(7i,/C). Then D(3,3) ~ 0, implying that the three different capacities of 3 coincide 
with those for 3. We may thus define the equivalence relation 3 ^ 3' <^ 3 — 3' and even use D(y as a 
metric on the set of equivalence classes without losing any information about our channels. 

Proof. Let D^{3,3') < e. By the first estimate in Lemma and the capacity formula (3/d(3) = Q{3) = 
linif^oo J niaxpg5(„»i) I^p, 3®') we get 



m 



□ 



\Qi3) - Qi3')\ = \Qid{3) - Qid{3')\ < z^(2e), 
\QiEi3)-QiEi3')\ <K2e), 



\Q{3)-Q{3')\ 



= \Qid{3)-Qid{3')\ 



< lim i^(2e) 




= ;.(2e). 
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For the proof in the case of an informed encoder let us first note that QieC^) = 
Um;_»oo infA/'ea niaxpg5(-^®i) /c(p, A/"®') holds. The second estimate in Lemma [29l justifies the following 
inequality: 

\QiEi^)-QiE{y)\ = |limi[inf max /,(p,AA®') - inf max AA'®')]| 

= lim |i inf max Up,Af'^^)-\mf max Up,Af"^')\ 
i^oo' I Afe3 peS{H«") I Af'ed pesiH'S') 

< lim i^{2e) 

I — >oo 

= i/(2e). 

□ 



9 Entanglement-Generating Capacity of Compound Channels 

In this last section wc will use the results obtained so far to achieve our main goal. Namely, we will 
determine the entanglement-generating capacity of quantum compound channels. We give the definitions 
of codes and capacity only for the most interesting case of uninformed users because there is no doubt 
that the reader will easily guess the definitions in the remaining cases. Nevertheless, we will state the 
coding result in all three cases. 

An entanglement-generating (/,fc;)-code for the compound channel 3 C C(7i,/C) with uninformed users 
consists of a pair (pi) where 7?.' G C(/C®', J^/) with ki = dim and ipi is a pure state on JF; (g) 7i®'. 
R e M-i- is called an achievable rate for 3 with uninformed users if there is a sequence of {l,ki) entanglement- 
generating codes with 

1. liminf;^oo j logfc; > R, and 

2. limi^oo'i'^iMe^ P{\i'i){i'i\:{^djr^'^TZ^ oJ\f'^^){\ipi){ipi\)) — 1 where V"; denotes the standard maximally 
entangled state on Ti (E) Ti and F{-,-) is the fidelity. 

The entanglement-generating capacity of 3 with uninformed users is then defined as the least upper bound 
of all achievable rates and is denoted by E{3). The entanglement-generating capacities Ejd(3) and Eie{3) 
of J with informed decoder or informed encoder are obtained if we allow the decoder or preparator to 
choose TZ^ or ipi in dependence oi M £ 3. 

Recall from the proof of Theorem[Tn]that to each subspace G CH and e > we always can find a subspace 
J^i C e?®' C H'^\ a recovery operation 7^' G C{JC®\Ti), and a unitary operation G C(H®',H®0 with 

h = dimTl > L2'(''^fAr«/c(7rg,A/-)-f-o(i«))j^ ^gQ) 

and 

^nf_^Fe(7r^,,7^' o AA®' oZ^') = 1 - o{f). (81) 

Notice that the maximally entangled state V'z in ® J^i purifies the maximally mixed state ttjt, on Ti and 
defining |(^/)((^;| :— , the relation ([8T|) can be rewritten as 

inf F{\4,i){'4,i\,id:F,®'R' oM''^\\^i){^i\)) = l-o{f). (82) 
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This together with ([50]) shows that 

i?p)> mf^ 7,(^5, AA). (83) 

Thus, using the compound BSST Lemma [T71 and arguing as in the proof of Theorem [T51 we can conclude 
that 

E(3) >Q(J)= Km i max inf Up,N®^). (84) 
Since E(3) < Eid{3) holds it suffices to show 

Eid{^) < Q(3) = lim i max inf Up,Af'^^) (85) 
in order to establish the coding theorem for Ejd{3) and E{3) simultaneously. 

The proof of ([55)1 relies on Lemma [^5] and the data processing inequality. Indeed, let R S K+ be an 
achievable entanglement generation rate for 3 with informed decoder and let ((7?,^)jvg3, be a 
corresponding sequence of {I, fc;)-codes, i.e we have 

1. liminfj^oo j logfc; > R, and 

2. mi^f^:, F{\il;i){ilji\, (idjr^ (g) TZ'j^ o Af'^^){\(pi){(pi\)) = 1 - e/ where lim;_>oo = and ipi denotes the 
standard maximally entangled state on J-i (g) J-i with Schmidt rank ki . 

Set := tr^,(|(^i)((^/|) and 

aV:-^d^,®7^VoAA®'(|^^)(^^|). 
Then the data processing inequality and Lemma [25] imply for each A/" G U 

/c(p',AA®') > /,(p',7eVoAA®') 
= A(aV) 

> A(|V'i)(^i|)---81og(A:0v^ 
e 

2 

logfc; 81og(A;/)v/Q. 

e 

(1 - 8^i)\ logki < J max inf /c(p,AA«') + ^ (86) 



Consequently, 
and we end up with 



R< lim sup- log fc; < lim - max inf lAp, 



which implies (j85p . The expression for Ej e (3) is obtained in a similar fashion. We summarize the results 
in the following theorem. 

Theorem 33 (Entanglement- Generating Capacities of 3) For arbitrary compound channels 3 C 
C(7Y, K.) we have 

E(3) = EidP) = Q(3) = lim \ max inf Up,N®^), 

and 

£;7i3(3) = Qie{'2) = lim y inf max UpM®')- 
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10 Conclusion and Further Remarks 



We have demonstrated that universal codes in the sense of compound quantum channels exist, and we 
determined the best achievable rates. The results are analogous to those well known related results from the 
classical information theory obtained by Wolfowitz [52], [33], and Blackwell, Breiman and Thomasian [S]. 
In contrast to the classical results on compound channels there is, in general, no single-letter description of 
the quantum capacities for entanglement transmission and generation over compound quantum channels. 
Notice, however, that for compound channels with classical input and quantum output (cq-channels) a 
single- letter characterization of the capacity is always possible according to the results of [3] . 
Natural candidates of compound quantum channels that might admit a single-letter capacity formula are 
given by sets of quantum channels consisting entirely of degradable channels. While it is quite easy to see 
from the results in ^ that the degradable compound quantum channels with informed encoder have a 
single-letter capacity formula for entanglement transmission and generation, the corresponding statement 
in uninformed case seems to be less obvious. This and related questions will be addressed in a future 
work. 

Another issue we left open in this paper is the relation of the capacities considered here to other quantum 
communication tasks, for example to the subspace transmission and average subspace transmission and 
even to the randomized versions thereof. Again, we hope to come back to this point at some later time. 



A Appendix 

Let £ and Q be subspaces of Ti. with £ d Q d Ti. where k dim£, dg := dimQ. p and pg will denote the 
orthogonal projections onto £ and Q. For a Haar distributed random variable U with values in il{G) and 
x,y B(Ti.) we define a random sesquilinear form 

bupU'{x,y) tv{UpU*x*UpU*y) - ^iv{UpU*x*)tv{UpU*y). 

In this appendix we will give an elementary derivation of the formula 

P -I 1- P 

E{bupU'ix,y)} = 7tr(pgx*pg;?;) -f — — 2 —tT{pgX*)tT{pgy) (87) 

dg-l dg{dg-l) 

for all x,y £ B(Ti) and where the expectation is taken with respect to the random variable U . 
Let us set 

pu UpU*. 

Since iv{pux*puy) and tT:{pux*)iv{puy) depend sesquilinearly on {x,y) G B{T-L) x Biji.) it suffices to 
consider operators of the form 

a; = |/i)(.gi| and y-|/2)(ff2| (88) 

with suitable /i,/2,5i,<?2 G 'H. With x^y as in we obtain 

tr{pux*puy) = {fi,Puf2){g2,Pu9i) 

= {fi®92,iU®U){p®p){U* ®U*)f2®9i), (89) 

and 

tl{puX*)tl{puy) = tr((p,y(g)p,y)(|5i)(/i|(g)|/2)(.g2)) 

= {fi®92AU®U){p®p)iU*®U*)gi®f2). (90) 



41 



Since the range of the random projection ([/ (g) U){p®p){JJ* ®U*) is contained in Q ®Q we see from 
and ([QO]) that we may (and will) w.l.o.g. assume that /i,/2,5i,52 S Q. Moreover, ((89)) and ((90|) show, 
due to the linearity of expectation, that the whole task of computing the average in ([57| is boiled down 
to the determination of 

A{p) := V.{{U ®U){p®p){U* ®U*)) 

{u®u){p®p){u*®u*)du. (91) 



/u(e) 

Obviously, A{p) \s u® u-invariant, i.e. A{j)){u ® u) — {u® u)A{p) for all u G •U(S). It is fairly standard 
(and proven by elementary means in [30] ) that then 

A{p) = alls + Plia, (92) 

where lis and IIq denote the projections onto the symmetric and antisymmetric subspaces oiQ®Q. More 
specifically 

ns:=^(id + F) = i(id-F), 

with id(/ g) = / ® 5 and F(/ ® g) = g /, for aU /, 5 G ^. 

Since Ilg and Ha are obviously u ® u-invariant, and IlsIIa = HqIIs = holds, the coefficients a and [3 in 
dnH) are given by 

a = ^tr((p«p)n.) = J^^fiiP^msl (93) 

and 

^tr((p«p)n.) = ^^(^tr((,«p)n.), (94) 
where dg = dim Q and we have used the facts that 

tr(n,) = dimran(n,) = + 

and 

tr(n,) = dimran(n,) = 
It is easily seen by an explicit computation with a suitable basis that 

tr((p®p)n,) = i(P + fc) and tr((p ® p)n,) = ^(P - fc). (95) 

For example choosing any orthonormal basis {ei, . . . , e^g} of Q with ei, . . . , Cfc G ran(p) we obtain 

dg 

tr((p(g)p)ns) = (e, (g) ej, (p (g) p)nse, (g) ej) 

fc 

= ^ (e^ (g Cj, (p g) _p)nsei g) Cj) 
1 

= i(fc2 + fc), 
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with a similar calculation for tr((p (g) p)na). Utilizing ([M]) . ([M]) . (|95p . and ([M]) we end up with 



fc2 + fc 



n, + 



fc2 - fc 



(96) 



dg{dg + 1) 



{dg - 



1) 



Now, jMl), (EI]), ((901), ([89]), and some simple algebra show that 




4^ 



tr(a;*y) 



tr(a;*)tr(y). 



+ 



dg{dl-l) 
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